A Reasonably Language Independent, Heuristic Algorithm for the Marking of Names in Running Texts

نویسنده

  • Benny Brodda
چکیده

0. Introduction T h e id en tifica tio n o f n am es an d abbrev ia tions in a ra w te x t is a n im p o rtan t sub ac tiv ity in the "to k en iza tio n " p ro cess , i.e . th e id en tifica tion o f th e b as ic u n its o f th e tex t: parag raphs, sen ten ces a n d w ords. T o k en iza tio n is in its tu rn an im p o rtan t su b ac tiv ity in th e "norm alization" o f tex ts , th e to ta lity o f o p era tio n s and p repara tions a te x t has to u n d erg o b efo re it is su itab le fo r be in g ad d ed to a te x t co rpus.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Genetic algorithm for Echo cancelling

In this paper, echo cancellation is done using genetic algorithm (GA). The genetic algorithm is implemented by two kinds of crossovers; heuristic and microbial. A new procedure is proposed to estimate the coefficients of adaptive filters used in echo cancellation with combination of the GA with Least-Mean-Square (LMS) method. The results are compared for various values of LMS step size and diff...

متن کامل

An Efficient Genetic Algorithm for Task Scheduling on Heterogeneous Computing Systems Based on TRIZ

An efficient assignment and scheduling of tasks is one of the key elements in effective utilization of heterogeneous multiprocessor systems. The task scheduling problem has been proven to be NP-hard is the reason why we used meta-heuristic methods for finding a suboptimal schedule. In this paper we proposed a new approach using TRIZ (specially 40 inventive principles). The basic idea of thi...

متن کامل

An Efficient Genetic Algorithm for Task Scheduling on Heterogeneous Computing Systems Based on TRIZ

An efficient assignment and scheduling of tasks is one of the key elements in effective utilization of heterogeneous multiprocessor systems. The task scheduling problem has been proven to be NP-hard is the reason why we used meta-heuristic methods for finding a suboptimal schedule. In this paper we proposed a new approach using TRIZ (specially 40 inventive principles). The basic idea of thi...

متن کامل

A Literary Anthroponomastics of Three Selected African Novels: A Cross Cultural Perspective

Names as markers of identity are a source of a wide variety of information. This paper explores the names of characters to show the sociocultural factors which influence the choice of names and the effects that the names of these characters have on the roles they play. Using a variety of personal names from Ayi Kwei Armah’s Fragments, Buchi Emecheta’s The Joys of Motherhood, a...

متن کامل

Dynamics of a Running Below-Knee Prosthesis Compared to Those of a Normal Subject

The normal human running has been simulated by two-dimensional biped model with 7 segments. Series of normal running experiments were performed and data of ground reaction forces measured by force plate was analyzed and was fitted to some Fourier series. The model is capable to simulate running for different ages and weights at different running speeds. A proportional derivative control algorit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998